Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ci: use much faster D: drive for TEMP on Windows #13129

Merged
merged 1 commit into from
Dec 29, 2024

Conversation

ichard26
Copy link
Member

@ichard26 ichard26 commented Dec 26, 2024

This is apparently an inherent limitation of Azure (which powers GHA) which uses a slow C: drive for the OS (read-optimized) and a fast D: drive for working space.

I first thought a Dev Drive would be even faster (#13123), after a lot of testing, it seems to be slower than simply moving TEMP to the D: drive. To prove this, I did a run where the entire test suite was run back to back three times:

  • First run: TEMP=D:/Temp
  • Second run: TEMP on ReFS/Dev Drive
  • Third run: TEMP=C:/Temp aka the status quo on main

image

This is apparently an inherent limitation of Azure (which powers GHA)
which uses a slow C: drive for the OS (read-optimized) and a fast D:
drive for working space.
@ichard26 ichard26 added the skip news Does not need a NEWS file entry (eg: trivial changes) label Dec 26, 2024
@ichard26
Copy link
Member Author

ichard26 commented Dec 26, 2024

For reference, here are the average CI runtimes for the last 100 successful runs on main:

[3.8 (1)] mean: 0:17:35 min: 0:16:38
[3.8 (2)] mean: 0:15:00 min: 0:13:46
[3.13 (1)] mean: 0:11:15 min: 0:09:57
[3.13 (2)] mean: 0:10:44 min: 0:09:22

image

@pfmoore
Copy link
Member

pfmoore commented Dec 26, 2024

The script tools/ci/devdrive.ps1 creates the dev drive at C:/pip_dev_drive.vhdx, which is the slow disk (and there's a message saying dev drives aren't supported, which might indicate C: is unsuitable in other ways as well). Is it faster if you have New-VHD -Path D:/pip_dev_drive.vhdx?

@ichard26
Copy link
Member Author

@pfmoore Sorry there's a lot of context I forgot to include. The Dev Drive PR is out of date. The final Dev Drive patch I tested is included on this branch: https://github.com/pypa/pip/tree/refs/heads/d-drive (specifically 483859b).

In that version, the VHDX is created on the D: drive as best for performance.

$OSVersion = [Environment]::OSVersion.Version
$Partition = New-VHD -Path D:/pip_dev_drive.vhdx -SizeBytes $size |

As well, it is true that Dev Drives aren't supported on the current Windows 2019 runners. I wrote in that check 🙂. However, I was talking to Steve Dower and he said that simply using a ReFS drive should get you 90% of the way there:

You might also be on an OS that doesn't have it yet and so it's disabled (ReFS on a separate drive should give you most of the benefits though - all Dev Drive really adds on top of that is reducing Windows Defender's impact, but that'll be turned off on GHA already).

If you look at the run I linked earlier (which is hard to read admittedly), you'll notice that last unit/integration test steps are done on the ReFS drive, while the first test steps are done on the D: drive. The D: drive beats the ReFS drive on D: handily.

@ichard26 ichard26 requested review from pradyunsg and removed request for pradyunsg December 27, 2024 01:59
@ichard26
Copy link
Member Author

CI is 🍏 and the Windows jobs are indeed much faster. This is a dead simple change, and I've confirmed already that Dev Drives seem to be worse than simply moving to the D: drive for us. I haven't heard any complains or concerns, so I'll merge this now.

@ichard26 ichard26 merged commit f8f0f5a into pypa:main Dec 29, 2024
31 checks passed
@ichard26 ichard26 deleted the use-d-drive branch December 29, 2024 18:13
@ichard26
Copy link
Member Author

ichard26 commented Jan 5, 2025

Plotting the last 40 successful CI runs on main, it's evident that the Windows runtimes are much better 🎉

image

And the averages since merge so far.

                   Last 5 CI runs on main                   
                                                            
  Job                          Mean    Minimum   Stdev      
 ────────────────────────────────────────────────────────── 
  tests / 3.13 / Windows / 2   9:32    9:27      0:04 (0%)  
  tests / 3.13 / Windows / 1   9:53    9:46      0:09 (1%)  
  tests / 3.8 / Windows / 2    10:44   10:27     0:12 (1%)  
  tests / 3.8 / Windows / 1    12:41   12:25     0:12 (1%) 

@ichard26
Copy link
Member Author

ichard26 commented Jan 5, 2025

Scaling the y-axis more appropriately, it's clear that it's the 3.8 jobs that are primarily benefiting from the change. The 3.13 jobs seem to be 10-15% faster.

image

zanieb added a commit to astral-sh/uv that referenced this pull request Jan 17, 2025
When using the standard Windows runners (as opposed to the _larger_
GitHub runners), an undocumented `D:` drive is available and performant.
We can save some money on by using this on a standard runner instead of
a larger runner with an ReFS drive. Switching to the `D:` drive was not
acceptable for `cargo test` >25m runtime.

Inspired by pypa/pip#13129
See actions/runner-images#8755

Timings (grain of salt — GitHub is super noisy):

- clippy: 2m 18s -> 2m 11s
- build binary: 2m 3s -> 2m 35s
- trampoline check (x86-64): 2m 32s -> 1m 50s (other architectures
similar)
- trampoline test (x86-64): 4m 12s -> 6m 7s
- trampoline test (i686): 6m 44s -> 5m 35s
@github-actions github-actions bot locked as resolved and limited conversation to collaborators Jan 20, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
skip news Does not need a NEWS file entry (eg: trivial changes)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants